PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
نویسندگان
چکیده
We design a compact but effective CNN model for optical flow by exploiting the well-known design principles: pyramid, warping, and cost volume. Cast in a learnable feature pyramid, our network uses the current optical flow estimate to warp the CNN features of the second image. It then uses the warped features and features of the first image to construct the cost volume, which is processed by a CNN network to decode the optical flow. As the cost volume is a more discriminative representation of the search space for the optical flow than raw images, a compact CNN decoder network is sufficient. Our model performs on par with the recent FlowNet2 method on the MPI Sintel and KITTI 2015 benchmarks, while being 17 times smaller in size and 2 times faster in inference. Our model protocol and learned parameters will be publicly available.
منابع مشابه
High Accuracy Optical Flow Method Based on a Theory for Warping: 3D Extension
This paper describes the implementation and qualitative and quantitative evaluation of a 3D optical flow algorithm, whose derivation is based on the 2D optical flow method published by Brox et al. [ECCV2004]. The optical flowminimizes an energy function built with three assumptions: a brightness constancy assumption, a gradient constancy assumption, and a smoothness assumption. Brox et al. mini...
متن کاملDevon: Deformable Volume Network for Learning Optical Flow
We propose a lightweight neural network model, Deformable Volume Network (Devon) for learning optical flow. Devon benefits from a multi-stage framework to iteratively refine its prediction. Each stage is by itself a neural network with an identical architecture. The optical flow between two stages is propagated with a newly proposed module, the deformable cost volume. The deformable cost volume...
متن کاملPyramid Stereo Matching Network
Recent work has shown that depth estimation from a stereo pair of images can be formulated as a supervised learning task to be resolved with convolutional neural networks (CNNs). However, current architectures rely on patch-based Siamese networks, lacking the means to exploit context information for finding correspondence in illposed regions. To tackle this problem, we propose PSMNet, a pyramid...
متن کاملTwo-Frame Optical Flow Formulation in an Unwarping Multiresolution Scheme
In this paper, we propose a new formulation of the Differential Optical Flow Equation (DOFE) between two consecutive images considering spatial and temporal information from both. The displacement field is computed in a Markov Random Field (MRF) framework. The solution is done by minimization of the Gibbs energy using a Direct Descent Energy (DDE) algorithm. A hybrid multiresolution approach, c...
متن کاملAdaptive Deep Pyramid Matching for Remote Sensing Scene Classification
Convolutional neural networks (CNNs) have attracted increasing attention in the remote sensing community. Most CNNs only take the last fully-connected layers as features for the classification of remotely sensed images, discarding the other convolutional layer features which may also be helpful for classification purposes. In this paper, we propose a new adaptive deep pyramid matching (ADPM) mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1709.02371 شماره
صفحات -
تاریخ انتشار 2017